Picture for Xuezhi Cao

Xuezhi Cao

Alphabetical order by last name

ATLAS: All-round Testing of Long-context Abilities across Scales

Add code
May 27, 2026
Viaarxiv icon

WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation

Add code
May 25, 2026
Viaarxiv icon

LARY: A Latent Action Representation Yielding Benchmark for Generalizable Vision-to-Action Alignment

Add code
Apr 13, 2026
Viaarxiv icon

General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks

Add code
Apr 13, 2026
Viaarxiv icon

TR-ICRL: Test-Time Rethinking for In-Context Reinforcement Learning

Add code
Apr 01, 2026
Viaarxiv icon

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Add code
Mar 29, 2026
Viaarxiv icon

LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning

Add code
Mar 22, 2026
Viaarxiv icon

AMemGym: Interactive Memory Benchmarking for Assistants in Long-Horizon Conversations

Add code
Mar 02, 2026
Viaarxiv icon

LongCat-Flash-Thinking-2601 Technical Report

Add code
Jan 23, 2026
Viaarxiv icon

UniHetero: Could Generation Enhance Understanding for Vision-Language-Model at Large Data Scale?

Add code
Dec 30, 2025
Viaarxiv icon